Overview

Dataset statistics

Number of variables35
Number of observations1029
Missing cells268
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory808.4 KiB
Average record size in memory804.5 B

Variable types

NUM17
CAT15
BOOL3

Warnings

EmployeeCount has constant value "1029" Constant
Over18 has constant value "1029" Constant
StandardHours has constant value "1029" Constant
MonthlyIncome is highly correlated with JobLevelHigh correlation
JobLevel is highly correlated with MonthlyIncomeHigh correlation
JobRole is highly correlated with DepartmentHigh correlation
Department is highly correlated with JobRoleHigh correlation
Age has 136 (13.2%) missing values Missing
DailyRate has 27 (2.6%) missing values Missing
DistanceFromHome has 95 (9.2%) missing values Missing
EmployeeNumber has unique values Unique
NumCompaniesWorked has 138 (13.4%) zeros Zeros
TrainingTimesLastYear has 39 (3.8%) zeros Zeros
YearsAtCompany has 31 (3.0%) zeros Zeros
YearsInCurrentRole has 169 (16.4%) zeros Zeros
YearsSinceLastPromotion has 415 (40.3%) zeros Zeros
YearsWithCurrManager has 188 (18.3%) zeros Zeros

Reproduction

Analysis started2020-12-16 23:59:32.803695
Analysis finished2020-12-17 00:00:27.470820
Duration54.67 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

Age
Real number (ℝ≥0)

MISSING

Distinct39
Distinct (%)4.4%
Missing136
Missing (%)13.2%
Infinite0
Infinite (%)0.0%
Mean37.93057111
Minimum18
Maximum60
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum18
5-th percentile23
Q131
median37
Q344
95-th percentile55
Maximum60
Range42
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.395978496
Coefficient of variation (CV)0.2477151865
Kurtosis-0.555578353
Mean37.93057111
Median Absolute Deviation (MAD)6
Skewness0.2372086586
Sum33872
Variance88.28441189
MonotocityNot monotonic
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%) 
29494.8%
 
36484.7%
 
34484.7%
 
31474.6%
 
30424.1%
 
33424.1%
 
32424.1%
 
40393.8%
 
38383.7%
 
27373.6%
 
42353.4%
 
41323.1%
 
37302.9%
 
45292.8%
 
39272.6%
 
43262.5%
 
46222.1%
 
50212.0%
 
55191.8%
 
51171.7%
 
24171.7%
 
44161.6%
 
53151.5%
 
52151.5%
 
48141.4%
 
Other values (14)12612.2%
 
(Missing)13613.2%
 
ValueCountFrequency (%) 
1850.5%
 
1970.7%
 
20101.0%
 
2180.8%
 
22131.3%
 
23121.2%
 
24171.7%
 
27373.6%
 
29494.8%
 
30424.1%
 
ValueCountFrequency (%) 
6030.3%
 
5970.7%
 
5880.8%
 
5730.3%
 
56101.0%
 
55191.8%
 
54131.3%
 
53151.5%
 
52151.5%
 
51171.7%
 

Attrition
Boolean

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
No
853 
Yes
176 
ValueCountFrequency (%) 
No85382.9%
 
Yes17617.1%
 

BusinessTravel
Categorical

Distinct3
Distinct (%)0.3%
Missing5
Missing (%)0.5%
Memory size8.2 KiB
Travel_Rarely
723 
Travel_Frequently
199 
Non-Travel
102 
ValueCountFrequency (%) 
Travel_Rarely72370.3%
 
Travel_Frequently19919.3%
 
Non-Travel1029.9%
 
(Missing)50.5%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length17
Median length13
Mean length13.42759961
Min length3

Overview of Unicode Properties

Unique unicode characters17
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e214515.5%
 
r194614.1%
 
l194614.1%
 
a175212.7%
 
T10247.4%
 
v10247.4%
 
_9226.7%
 
y9226.7%
 
R7235.2%
 
n3112.3%
 
F1991.4%
 
q1991.4%
 
u1991.4%
 
t1991.4%
 
N1020.7%
 
o1020.7%
 
-1020.7%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter1074577.8%
 
Uppercase Letter204814.8%
 
Connector Punctuation9226.7%
 
Dash Punctuation1020.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
T102450.0%
 
R72335.3%
 
F1999.7%
 
N1025.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e214520.0%
 
r194618.1%
 
l194618.1%
 
a175216.3%
 
v10249.5%
 
y9228.6%
 
n3112.9%
 
q1991.9%
 
u1991.9%
 
t1991.9%
 
o1020.9%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_922100.0%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-102100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin1279392.6%
 
Common10247.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e214516.8%
 
r194615.2%
 
l194615.2%
 
a175213.7%
 
T10248.0%
 
v10248.0%
 
y9227.2%
 
R7235.7%
 
n3112.4%
 
F1991.6%
 
q1991.6%
 
u1991.6%
 
t1991.6%
 
N1020.8%
 
o1020.8%
 

Most frequent Common characters

ValueCountFrequency (%) 
_92290.0%
 
-10210.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII13817100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e214515.5%
 
r194614.1%
 
l194614.1%
 
a175212.7%
 
T10247.4%
 
v10247.4%
 
_9226.7%
 
y9226.7%
 
R7235.2%
 
n3112.3%
 
F1991.4%
 
q1991.4%
 
u1991.4%
 
t1991.4%
 
N1020.7%
 
o1020.7%
 
-1020.7%
 

DailyRate
Real number (ℝ≥0)

MISSING

Distinct692
Distinct (%)69.1%
Missing27
Missing (%)2.6%
Infinite0
Infinite (%)0.0%
Mean800.5289421
Minimum102
Maximum1496
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum102
5-th percentile157.15
Q1458.25
median801.5
Q31162
95-th percentile1421.95
Maximum1496
Range1394
Interquartile range (IQR)703.75

Descriptive statistics

Standard deviation408.1098284
Coefficient of variation (CV)0.509800217
Kurtosis-1.221174387
Mean800.5289421
Median Absolute Deviation (MAD)354.5
Skewness-0.003923162652
Sum802130
Variance166553.632
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
132940.4%
 
26740.4%
 
108240.4%
 
32940.4%
 
69140.4%
 
11740.4%
 
53040.4%
 
89030.3%
 
93030.3%
 
95930.3%
 
81330.3%
 
101730.3%
 
109730.3%
 
97730.3%
 
95030.3%
 
140430.3%
 
30330.3%
 
33730.3%
 
114630.3%
 
127730.3%
 
127630.3%
 
144830.3%
 
140030.3%
 
13530.3%
 
94530.3%
 
Other values (667)92089.4%
 
(Missing)272.6%
 
ValueCountFrequency (%) 
10210.1%
 
10310.1%
 
10510.1%
 
10710.1%
 
10910.1%
 
11120.2%
 
11620.2%
 
11740.4%
 
11810.1%
 
11910.1%
 
ValueCountFrequency (%) 
149610.1%
 
149520.2%
 
149210.1%
 
149010.1%
 
148810.1%
 
148530.3%
 
148210.1%
 
148020.2%
 
147910.1%
 
147630.3%
 

Department
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
Research & Development
676 
Sales
311 
Human Resources
 
42
ValueCountFrequency (%) 
Research & Development67665.7%
 
Sales31130.2%
 
Human Resources424.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length22
Median length22
Mean length16.57628766
Min length5

Overview of Unicode Properties

Unique unicode characters20
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e377522.1%
 
13948.2%
 
s10716.3%
 
a10296.0%
 
l9875.8%
 
R7184.2%
 
r7184.2%
 
c7184.2%
 
o7184.2%
 
m7184.2%
 
n7184.2%
 
h6764.0%
 
&6764.0%
 
D6764.0%
 
v6764.0%
 
p6764.0%
 
t6764.0%
 
S3111.8%
 
u840.5%
 
H420.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter1324077.6%
 
Uppercase Letter174710.2%
 
Space Separator13948.2%
 
Other Punctuation6764.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
R71841.1%
 
D67638.7%
 
S31117.8%
 
H422.4%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e377528.5%
 
s10718.1%
 
a10297.8%
 
l9877.5%
 
r7185.4%
 
c7185.4%
 
o7185.4%
 
m7185.4%
 
n7185.4%
 
h6765.1%
 
v6765.1%
 
p6765.1%
 
t6765.1%
 
u840.6%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
1394100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
&676100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin1498787.9%
 
Common207012.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e377525.2%
 
s10717.1%
 
a10296.9%
 
l9876.6%
 
R7184.8%
 
r7184.8%
 
c7184.8%
 
o7184.8%
 
m7184.8%
 
n7184.8%
 
h6764.5%
 
D6764.5%
 
v6764.5%
 
p6764.5%
 
t6764.5%
 
S3112.1%
 
u840.6%
 
H420.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
139467.3%
 
&67632.7%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII17057100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e377522.1%
 
13948.2%
 
s10716.3%
 
a10296.0%
 
l9875.8%
 
R7184.2%
 
r7184.2%
 
c7184.2%
 
o7184.2%
 
m7184.2%
 
n7184.2%
 
h6764.0%
 
&6764.0%
 
D6764.0%
 
v6764.0%
 
p6764.0%
 
t6764.0%
 
S3111.8%
 
u840.5%
 
H420.2%
 

DistanceFromHome
Real number (ℝ≥0)

MISSING

Distinct27
Distinct (%)2.9%
Missing95
Missing (%)9.2%
Infinite0
Infinite (%)0.0%
Mean9.930406852
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median8
Q316
95-th percentile26
Maximum29
Range28
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.421790527
Coefficient of variation (CV)0.8480811161
Kurtosis-0.565511992
Mean9.930406852
Median Absolute Deviation (MAD)6
Skewness0.7847258433
Sum9275
Variance70.92655568
MonotocityNot monotonic
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%) 
114714.3%
 
214113.7%
 
9686.6%
 
8575.5%
 
10575.5%
 
7535.2%
 
4504.9%
 
6434.2%
 
26242.3%
 
18222.1%
 
16222.1%
 
11212.0%
 
25201.9%
 
24191.8%
 
29181.7%
 
12171.7%
 
23171.7%
 
28171.7%
 
19161.6%
 
22161.6%
 
14151.5%
 
15151.5%
 
17141.4%
 
13121.2%
 
21121.2%
 
Other values (2)212.0%
 
(Missing)959.2%
 
ValueCountFrequency (%) 
114714.3%
 
214113.7%
 
4504.9%
 
6434.2%
 
7535.2%
 
8575.5%
 
9686.6%
 
10575.5%
 
11212.0%
 
12171.7%
 
ValueCountFrequency (%) 
29181.7%
 
28171.7%
 
27101.0%
 
26242.3%
 
25201.9%
 
24191.8%
 
23171.7%
 
22161.6%
 
21121.2%
 
20111.1%
 

Education
Real number (ℝ≥0)

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.89212828
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile4
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.053540556
Coefficient of variation (CV)0.3642786399
Kurtosis-0.6487944368
Mean2.89212828
Median Absolute Deviation (MAD)1
Skewness-0.2737624588
Sum2976
Variance1.109947703
MonotocityNot monotonic
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
338637.5%
 
427927.1%
 
219418.9%
 
113413.0%
 
5363.5%
 
ValueCountFrequency (%) 
113413.0%
 
219418.9%
 
338637.5%
 
427927.1%
 
5363.5%
 
ValueCountFrequency (%) 
5363.5%
 
427927.1%
 
338637.5%
 
219418.9%
 
113413.0%
 

EducationField
Categorical

Distinct6
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
Life Sciences
426 
Medical
328 
Marketing
110 
Technical Degree
82 
Other
66 
ValueCountFrequency (%) 
Life Sciences42641.4%
 
Medical32831.9%
 
Marketing11010.7%
 
Technical Degree828.0%
 
Other666.4%
 
Human Resources171.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length16
Median length13
Mean length10.41885326
Min length5

Overview of Unicode Properties

Unique unicode characters26
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e214420.0%
 
i137212.8%
 
c136112.7%
 
n6355.9%
 
a5375.0%
 
5254.9%
 
s4604.3%
 
M4384.1%
 
L4264.0%
 
f4264.0%
 
S4264.0%
 
l4103.8%
 
d3283.1%
 
r2752.6%
 
g1921.8%
 
t1761.6%
 
h1481.4%
 
k1101.0%
 
T820.8%
 
D820.8%
 
O660.6%
 
u340.3%
 
H170.2%
 
m170.2%
 
R170.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter864280.6%
 
Uppercase Letter155414.5%
 
Space Separator5254.9%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M43828.2%
 
L42627.4%
 
S42627.4%
 
T825.3%
 
D825.3%
 
O664.2%
 
H171.1%
 
R171.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e214424.8%
 
i137215.9%
 
c136115.7%
 
n6357.3%
 
a5376.2%
 
s4605.3%
 
f4264.9%
 
l4104.7%
 
d3283.8%
 
r2753.2%
 
g1922.2%
 
t1762.0%
 
h1481.7%
 
k1101.3%
 
u340.4%
 
m170.2%
 
o170.2%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
525100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin1019695.1%
 
Common5254.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e214421.0%
 
i137213.5%
 
c136113.3%
 
n6356.2%
 
a5375.3%
 
s4604.5%
 
M4384.3%
 
L4264.2%
 
f4264.2%
 
S4264.2%
 
l4104.0%
 
d3283.2%
 
r2752.7%
 
g1921.9%
 
t1761.7%
 
h1481.5%
 
k1101.1%
 
T820.8%
 
D820.8%
 
O660.6%
 
u340.3%
 
H170.2%
 
m170.2%
 
R170.2%
 
o170.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
525100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII10721100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e214420.0%
 
i137212.8%
 
c136112.7%
 
n6355.9%
 
a5375.0%
 
5254.9%
 
s4604.3%
 
M4384.1%
 
L4264.0%
 
f4264.0%
 
S4264.0%
 
l4103.8%
 
d3283.1%
 
r2752.6%
 
g1921.8%
 
t1761.6%
 
h1481.4%
 
k1101.0%
 
T820.8%
 
D820.8%
 
O660.6%
 
u340.3%
 
H170.2%
 
m170.2%
 
R170.2%
 

EmployeeCount
Boolean

CONSTANT
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
1
1029 
ValueCountFrequency (%) 
11029100.0%
 

EmployeeNumber
Real number (ℝ≥0)

UNIQUE

Distinct1029
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1024.367347
Minimum1
Maximum2068
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum1
5-th percentile94.4
Q1496
median1019
Q31553
95-th percentile1971.6
Maximum2068
Range2067
Interquartile range (IQR)1057

Descriptive statistics

Standard deviation606.3016348
Coefficient of variation (CV)0.5918791111
Kurtosis-1.206539811
Mean1024.367347
Median Absolute Deviation (MAD)530
Skewness0.02147841012
Sum1054074
Variance367601.6723
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
204510.1%
 
74110.1%
 
66410.1%
 
66310.1%
 
66210.1%
 
66110.1%
 
65910.1%
 
65710.1%
 
65310.1%
 
65210.1%
 
64810.1%
 
64410.1%
 
64310.1%
 
64110.1%
 
63910.1%
 
63810.1%
 
63510.1%
 
63210.1%
 
63110.1%
 
63010.1%
 
62610.1%
 
62510.1%
 
62410.1%
 
62210.1%
 
62010.1%
 
Other values (1004)100497.6%
 
ValueCountFrequency (%) 
110.1%
 
210.1%
 
410.1%
 
510.1%
 
710.1%
 
810.1%
 
1010.1%
 
1110.1%
 
1210.1%
 
1310.1%
 
ValueCountFrequency (%) 
206810.1%
 
206410.1%
 
206210.1%
 
206110.1%
 
206010.1%
 
205610.1%
 
205410.1%
 
205310.1%
 
205110.1%
 
204910.1%
 
Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
3
310 
4
300 
2
212 
1
207 
ValueCountFrequency (%) 
331030.1%
 
430029.2%
 
221220.6%
 
120720.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
331030.1%
 
430029.2%
 
221220.6%
 
120720.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number1029100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
331030.1%
 
430029.2%
 
221220.6%
 
120720.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Common1029100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
331030.1%
 
430029.2%
 
221220.6%
 
120720.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1029100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
331030.1%
 
430029.2%
 
221220.6%
 
120720.1%
 

Gender
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
Male
617 
Female
412 
ValueCountFrequency (%) 
Male61760.0%
 
Female41240.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length4
Mean length4.800777454
Min length4

Overview of Unicode Properties

Unique unicode characters6
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e144129.2%
 
a102920.8%
 
l102920.8%
 
M61712.5%
 
F4128.3%
 
m4128.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter391179.2%
 
Uppercase Letter102920.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M61760.0%
 
F41240.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e144136.8%
 
a102926.3%
 
l102926.3%
 
m41210.5%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin4940100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e144129.2%
 
a102920.8%
 
l102920.8%
 
M61712.5%
 
F4128.3%
 
m4128.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII4940100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e144129.2%
 
a102920.8%
 
l102920.8%
 
M61712.5%
 
F4128.3%
 
m4128.3%
 

HourlyRate
Real number (ℝ≥0)

Distinct71
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66.68027211
Minimum30
Maximum100
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum30
5-th percentile34
Q148
median67
Q384
95-th percentile97.6
Maximum100
Range70
Interquartile range (IQR)36

Descriptive statistics

Standard deviation20.47409414
Coefficient of variation (CV)0.307048749
Kurtosis-1.212043515
Mean66.68027211
Median Absolute Deviation (MAD)18
Skewness-0.08874675179
Sum68614
Variance419.1885307
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
48252.4%
 
79242.3%
 
98232.2%
 
87232.2%
 
72201.9%
 
66201.9%
 
84201.9%
 
52191.8%
 
54191.8%
 
92191.8%
 
97191.8%
 
82181.7%
 
61181.7%
 
60181.7%
 
46181.7%
 
43181.7%
 
42171.7%
 
99171.7%
 
56171.7%
 
73161.6%
 
45161.6%
 
57161.6%
 
75161.6%
 
86161.6%
 
95161.6%
 
Other values (46)56154.5%
 
ValueCountFrequency (%) 
30141.4%
 
3190.9%
 
32141.4%
 
33131.3%
 
34111.1%
 
35121.2%
 
36121.2%
 
37141.4%
 
3890.9%
 
39121.2%
 
ValueCountFrequency (%) 
100121.2%
 
99171.7%
 
98232.2%
 
97191.8%
 
96151.5%
 
95161.6%
 
94151.5%
 
93131.3%
 
92191.8%
 
91141.4%
 

JobInvolvement
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
3
606 
2
269 
4
94 
1
 
60
ValueCountFrequency (%) 
360658.9%
 
226926.1%
 
4949.1%
 
1605.8%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
360658.9%
 
226926.1%
 
4949.1%
 
1605.8%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number1029100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
360658.9%
 
226926.1%
 
4949.1%
 
1605.8%
 

Most occurring scripts

ValueCountFrequency (%) 
Common1029100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
360658.9%
 
226926.1%
 
4949.1%
 
1605.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1029100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
360658.9%
 
226926.1%
 
4949.1%
 
1605.8%
 

JobLevel
Real number (ℝ≥0)

HIGH CORRELATION

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.043731778
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.118917656
Coefficient of variation (CV)0.5474875266
Kurtosis0.3226034541
Mean2.043731778
Median Absolute Deviation (MAD)1
Skewness1.024196627
Sum2103
Variance1.251976722
MonotocityNot monotonic
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
140339.2%
 
235034.0%
 
315114.7%
 
4787.6%
 
5474.6%
 
ValueCountFrequency (%) 
140339.2%
 
235034.0%
 
315114.7%
 
4787.6%
 
5474.6%
 
ValueCountFrequency (%) 
5474.6%
 
4787.6%
 
315114.7%
 
235034.0%
 
140339.2%
 

JobRole
Categorical

HIGH CORRELATION

Distinct9
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
Sales Executive
217 
Research Scientist
214 
Laboratory Technician
179 
Manufacturing Director
95 
Healthcare Representative
89 
Other values (4)
235 
ValueCountFrequency (%) 
Sales Executive21721.1%
 
Research Scientist21420.8%
 
Laboratory Technician17917.4%
 
Manufacturing Director959.2%
 
Healthcare Representative898.6%
 
Manager737.1%
 
Sales Representative666.4%
 
Research Director626.0%
 
Human Resources343.3%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length25
Median length18
Mean length18.05247813
Min length7

Overview of Unicode Properties

Unique unicode characters29
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e275814.8%
 
a17999.7%
 
t14757.9%
 
c14407.8%
 
i14107.6%
 
r13947.5%
 
n10245.5%
 
s9965.4%
 
9565.1%
 
o5493.0%
 
h5442.9%
 
S4972.7%
 
u4752.6%
 
R4652.5%
 
l3722.0%
 
v3722.0%
 
E2171.2%
 
x2171.2%
 
L1791.0%
 
b1791.0%
 
y1791.0%
 
T1791.0%
 
M1680.9%
 
g1680.9%
 
D1570.8%
 
Other values (4)4072.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter1563584.2%
 
Uppercase Letter198510.7%
 
Space Separator9565.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
S49725.0%
 
R46523.4%
 
E21710.9%
 
L1799.0%
 
T1799.0%
 
M1688.5%
 
D1577.9%
 
H1236.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e275817.6%
 
a179911.5%
 
t14759.4%
 
c14409.2%
 
i14109.0%
 
r13948.9%
 
n10246.5%
 
s9966.4%
 
o5493.5%
 
h5443.5%
 
u4753.0%
 
l3722.4%
 
v3722.4%
 
x2171.4%
 
b1791.1%
 
y1791.1%
 
g1681.1%
 
p1551.0%
 
f950.6%
 
m340.2%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
956100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin1762094.9%
 
Common9565.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e275815.7%
 
a179910.2%
 
t14758.4%
 
c14408.2%
 
i14108.0%
 
r13947.9%
 
n10245.8%
 
s9965.7%
 
o5493.1%
 
h5443.1%
 
S4972.8%
 
u4752.7%
 
R4652.6%
 
l3722.1%
 
v3722.1%
 
E2171.2%
 
x2171.2%
 
L1791.0%
 
b1791.0%
 
y1791.0%
 
T1791.0%
 
M1681.0%
 
g1681.0%
 
D1570.9%
 
p1550.9%
 
Other values (3)2521.4%
 

Most frequent Common characters

ValueCountFrequency (%) 
956100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII18576100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e275814.8%
 
a17999.7%
 
t14757.9%
 
c14407.8%
 
i14107.6%
 
r13947.5%
 
n10245.5%
 
s9965.4%
 
9565.1%
 
o5493.0%
 
h5442.9%
 
S4972.7%
 
u4752.6%
 
R4652.5%
 
l3722.0%
 
v3722.0%
 
E2171.2%
 
x2171.2%
 
L1791.0%
 
b1791.0%
 
y1791.0%
 
T1791.0%
 
M1680.9%
 
g1680.9%
 
D1570.8%
 
Other values (4)4072.2%
 

JobSatisfaction
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
4
315 
3
301 
2
215 
1
198 
ValueCountFrequency (%) 
431530.6%
 
330129.3%
 
221520.9%
 
119819.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
431530.6%
 
330129.3%
 
221520.9%
 
119819.2%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number1029100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
431530.6%
 
330129.3%
 
221520.9%
 
119819.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Common1029100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
431530.6%
 
330129.3%
 
221520.9%
 
119819.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1029100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
431530.6%
 
330129.3%
 
221520.9%
 
119819.2%
 

MaritalStatus
Categorical

Distinct3
Distinct (%)0.3%
Missing5
Missing (%)0.5%
Memory size8.2 KiB
Married
474 
Single
320 
Divorced
230 
ValueCountFrequency (%) 
Married47446.1%
 
Single32031.1%
 
Divorced23022.4%
 
(Missing)50.5%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length8
Median length7
Mean length6.893100097
Min length3

Overview of Unicode Properties

Unique unicode characters14
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
r117816.6%
 
i102414.4%
 
e102414.4%
 
d7049.9%
 
a4796.8%
 
M4746.7%
 
n3304.7%
 
S3204.5%
 
g3204.5%
 
l3204.5%
 
D2303.2%
 
v2303.2%
 
o2303.2%
 
c2303.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter606985.6%
 
Uppercase Letter102414.4%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M47446.3%
 
S32031.2%
 
D23022.5%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
r117819.4%
 
i102416.9%
 
e102416.9%
 
d70411.6%
 
a4797.9%
 
n3305.4%
 
g3205.3%
 
l3205.3%
 
v2303.8%
 
o2303.8%
 
c2303.8%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin7093100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
r117816.6%
 
i102414.4%
 
e102414.4%
 
d7049.9%
 
a4796.8%
 
M4746.7%
 
n3304.7%
 
S3204.5%
 
g3204.5%
 
l3204.5%
 
D2303.2%
 
v2303.2%
 
o2303.2%
 
c2303.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII7093100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
r117816.6%
 
i102414.4%
 
e102414.4%
 
d7049.9%
 
a4796.8%
 
M4746.7%
 
n3304.7%
 
S3204.5%
 
g3204.5%
 
l3204.5%
 
D2303.2%
 
v2303.2%
 
o2303.2%
 
c2303.2%
 

MonthlyIncome
Real number (ℝ≥0)

HIGH CORRELATION

Distinct963
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6449.79689
Minimum1009
Maximum19999
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum1009
5-th percentile2088.4
Q12814
median4735
Q38446
95-th percentile17623.6
Maximum19999
Range18990
Interquartile range (IQR)5632

Descriptive statistics

Standard deviation4794.525367
Coefficient of variation (CV)0.7433606745
Kurtosis0.8623949801
Mean6449.79689
Median Absolute Deviation (MAD)2107
Skewness1.347579672
Sum6636841
Variance22987473.49
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
234230.3%
 
245130.3%
 
261030.3%
 
240430.3%
 
634730.3%
 
627220.2%
 
265720.2%
 
556220.2%
 
556120.2%
 
213220.2%
 
212720.2%
 
255920.2%
 
303820.2%
 
490720.2%
 
299620.2%
 
522820.2%
 
210920.2%
 
272020.2%
 
245020.2%
 
243620.2%
 
234020.2%
 
402520.2%
 
269320.2%
 
237220.2%
 
534620.2%
 
Other values (938)97494.7%
 
ValueCountFrequency (%) 
100910.1%
 
105210.1%
 
108110.1%
 
109110.1%
 
110210.1%
 
111810.1%
 
112910.1%
 
120010.1%
 
122310.1%
 
126110.1%
 
ValueCountFrequency (%) 
1999910.1%
 
1997310.1%
 
1994310.1%
 
1992610.1%
 
1985910.1%
 
1984710.1%
 
1984510.1%
 
1983310.1%
 
1971710.1%
 
1970110.1%
 

MonthlyRate
Real number (ℝ≥0)

Distinct1010
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14250.62974
Minimum2094
Maximum26999
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum2094
5-th percentile3423.8
Q17950
median14295
Q320392
95-th percentile25374
Maximum26999
Range24905
Interquartile range (IQR)12442

Descriptive statistics

Standard deviation7088.757938
Coefficient of variation (CV)0.4974347147
Kurtosis-1.209325633
Mean14250.62974
Median Absolute Deviation (MAD)6194
Skewness0.02185778605
Sum14663898
Variance50250489.1
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1937320.2%
 
774420.2%
 
465820.2%
 
2210220.2%
 
732420.2%
 
1173720.2%
 
1615420.2%
 
1531820.2%
 
2444420.2%
 
2153420.2%
 
1285820.2%
 
535520.2%
 
1235520.2%
 
912920.2%
 
2028420.2%
 
2207420.2%
 
2532620.2%
 
422320.2%
 
333920.2%
 
2321310.1%
 
2526510.1%
 
407710.1%
 
1775910.1%
 
1091910.1%
 
887010.1%
 
Other values (985)98595.7%
 
ValueCountFrequency (%) 
209410.1%
 
209710.1%
 
211210.1%
 
212210.1%
 
212510.1%
 
213710.1%
 
222710.1%
 
224310.1%
 
225310.1%
 
226110.1%
 
ValueCountFrequency (%) 
2699910.1%
 
2699710.1%
 
2696810.1%
 
2695610.1%
 
2693310.1%
 
2691410.1%
 
2689710.1%
 
2686210.1%
 
2684110.1%
 
2682010.1%
 

NumCompaniesWorked
Real number (ℝ≥0)

ZEROS

Distinct10
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.653061224
Minimum0
Maximum9
Zeros138
Zeros (%)13.4%
Memory size8.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q34
95-th percentile8
Maximum9
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.508185599
Coefficient of variation (CV)0.9453930333
Kurtosis0.08340350463
Mean2.653061224
Median Absolute Deviation (MAD)1
Skewness1.074605689
Sum2730
Variance6.290994997
MonotocityNot monotonic
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
137736.6%
 
013813.4%
 
210910.6%
 
310510.2%
 
4858.3%
 
7515.0%
 
6494.8%
 
5434.2%
 
9383.7%
 
8343.3%
 
ValueCountFrequency (%) 
013813.4%
 
137736.6%
 
210910.6%
 
310510.2%
 
4858.3%
 
5434.2%
 
6494.8%
 
7515.0%
 
8343.3%
 
9383.7%
 
ValueCountFrequency (%) 
9383.7%
 
8343.3%
 
7515.0%
 
6494.8%
 
5434.2%
 
4858.3%
 
310510.2%
 
210910.6%
 
137736.6%
 
013813.4%
 

Over18
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
Y
1029 
ValueCountFrequency (%) 
Y1029100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters1
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
Y1029100.0%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter1029100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
Y1029100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin1029100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
Y1029100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1029100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
Y1029100.0%
 

OverTime
Boolean

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
No
731 
Yes
298 
ValueCountFrequency (%) 
No73171.0%
 
Yes29829.0%
 

PercentSalaryHike
Real number (ℝ≥0)

Distinct15
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.26044704
Minimum11
Maximum25
Zeros0
Zeros (%)0.0%
Memory size8.2 KiB

Quantile statistics

Minimum11
5-th percentile11
Q112
median14
Q318
95-th percentile22
Maximum25
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.665779308
Coefficient of variation (CV)0.2402144117
Kurtosis-0.3386498092
Mean15.26044704
Median Absolute Deviation (MAD)2
Skewness0.8012882752
Sum15703
Variance13.43793793
MonotocityNot monotonic
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%) 
1314614.2%
 
1414113.7%
 
1114113.7%
 
1213913.5%
 
15727.0%
 
18595.7%
 
17595.7%
 
19565.4%
 
16525.1%
 
20484.7%
 
22373.6%
 
21302.9%
 
23232.2%
 
25141.4%
 
24121.2%
 
ValueCountFrequency (%) 
1114113.7%
 
1213913.5%
 
1314614.2%
 
1414113.7%
 
15727.0%
 
16525.1%
 
17595.7%
 
18595.7%
 
19565.4%
 
20484.7%
 
ValueCountFrequency (%) 
25141.4%
 
24121.2%
 
23232.2%
 
22373.6%
 
21302.9%
 
20484.7%
 
19565.4%
 
18595.7%
 
17595.7%
 
16525.1%
 
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
3
865 
4
164 
ValueCountFrequency (%) 
386584.1%
 
416415.9%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters2
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
386584.1%
 
416415.9%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number1029100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
386584.1%
 
416415.9%
 

Most occurring scripts

ValueCountFrequency (%) 
Common1029100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
386584.1%
 
416415.9%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1029100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
386584.1%
 
416415.9%
 
Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
3
318 
4
293 
2
224 
1
194 
ValueCountFrequency (%) 
331830.9%
 
429328.5%
 
222421.8%
 
119418.9%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
331830.9%
 
429328.5%
 
222421.8%
 
119418.9%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number1029100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
331830.9%
 
429328.5%
 
222421.8%
 
119418.9%
 

Most occurring scripts

ValueCountFrequency (%) 
Common1029100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
331830.9%
 
429328.5%
 
222421.8%
 
119418.9%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1029100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
331830.9%
 
429328.5%
 
222421.8%
 
119418.9%
 

StandardHours
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
80
1029 
ValueCountFrequency (%) 
801029100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

Overview of Unicode Properties

Unique unicode characters2
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
8102950.0%
 
0102950.0%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number2058100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
8102950.0%
 
0102950.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common2058100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
8102950.0%
 
0102950.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII2058100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
8102950.0%
 
0102950.0%
 

StockOptionLevel
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
0
432 
1
417 
2
111 
3
69 
ValueCountFrequency (%) 
043242.0%
 
141740.5%
 
211110.8%
 
3696.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
043242.0%
 
141740.5%
 
211110.8%
 
3696.7%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number1029100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
043242.0%
 
141740.5%
 
211110.8%
 
3696.7%
 

Most occurring scripts

ValueCountFrequency (%) 
Common1029100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
043242.0%
 
141740.5%
 
211110.8%
 
3696.7%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1029100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
043242.0%
 
141740.5%
 
211110.8%
 
3696.7%
 

TotalWorkingYears
Real number (ℝ≥0)

Distinct40
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.19630709
Minimum0
Maximum40
Zeros6
Zeros (%)0.6%
Memory size8.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q16
median10
Q315
95-th percentile28
Maximum40
Range40
Interquartile range (IQR)9

Descriptive statistics

Standard deviation7.85758116
Coefficient of variation (CV)0.7018011469
Kurtosis0.7914308701
Mean11.19630709
Median Absolute Deviation (MAD)5
Skewness1.079672117
Sum11521
Variance61.74158168
MonotocityNot monotonic
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%) 
1013413.0%
 
6928.9%
 
8737.1%
 
5676.5%
 
1666.4%
 
9575.5%
 
4484.7%
 
7454.4%
 
15313.0%
 
3302.9%
 
12292.8%
 
11282.7%
 
21272.6%
 
13272.6%
 
16252.4%
 
2242.3%
 
20232.2%
 
17222.1%
 
18201.9%
 
14191.8%
 
19191.8%
 
23151.5%
 
24141.4%
 
22131.3%
 
2690.9%
 
Other values (15)727.0%
 
ValueCountFrequency (%) 
060.6%
 
1666.4%
 
2242.3%
 
3302.9%
 
4484.7%
 
5676.5%
 
6928.9%
 
7454.4%
 
8737.1%
 
9575.5%
 
ValueCountFrequency (%) 
4010.1%
 
3810.1%
 
3720.2%
 
3640.4%
 
3530.3%
 
3440.4%
 
3340.4%
 
3260.6%
 
3190.9%
 
3040.4%
 

TrainingTimesLastYear
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.782312925
Minimum0
Maximum6
Zeros39
Zeros (%)3.8%
Memory size8.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.28340125
Coefficient of variation (CV)0.4612713541
Kurtosis0.564671261
Mean2.782312925
Median Absolute Deviation (MAD)1
Skewness0.5580773883
Sum2863
Variance1.64711877
MonotocityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%) 
238537.4%
 
334633.6%
 
4858.3%
 
5797.7%
 
1504.9%
 
6454.4%
 
0393.8%
 
ValueCountFrequency (%) 
0393.8%
 
1504.9%
 
238537.4%
 
334633.6%
 
4858.3%
 
5797.7%
 
6454.4%
 
ValueCountFrequency (%) 
6454.4%
 
5797.7%
 
4858.3%
 
334633.6%
 
238537.4%
 
1504.9%
 
0393.8%
 

WorkLifeBalance
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
3
626 
2
250 
4
99 
1
 
54
ValueCountFrequency (%) 
362660.8%
 
225024.3%
 
4999.6%
 
1545.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
362660.8%
 
225024.3%
 
4999.6%
 
1545.2%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number1029100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
362660.8%
 
225024.3%
 
4999.6%
 
1545.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Common1029100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
362660.8%
 
225024.3%
 
4999.6%
 
1545.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1029100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
362660.8%
 
225024.3%
 
4999.6%
 
1545.2%
 

YearsAtCompany
Real number (ℝ≥0)

ZEROS

Distinct32
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.942662779
Minimum0
Maximum37
Zeros31
Zeros (%)3.0%
Memory size8.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q310
95-th percentile20
Maximum37
Range37
Interquartile range (IQR)7

Descriptive statistics

Standard deviation6.068321865
Coefficient of variation (CV)0.8740625979
Kurtosis3.314188601
Mean6.942662779
Median Absolute Deviation (MAD)3
Skewness1.670784773
Sum7144
Variance36.82453026
MonotocityNot monotonic
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%) 
513713.3%
 
112512.1%
 
3898.6%
 
2888.6%
 
10868.4%
 
4858.3%
 
7565.4%
 
6555.3%
 
9535.2%
 
8504.9%
 
0313.0%
 
11242.3%
 
20232.2%
 
13141.4%
 
15141.4%
 
14121.2%
 
22101.0%
 
18101.0%
 
21101.0%
 
12101.0%
 
1690.9%
 
1980.8%
 
1770.7%
 
3340.4%
 
2440.4%
 
Other values (7)151.5%
 
ValueCountFrequency (%) 
0313.0%
 
112512.1%
 
2888.6%
 
3898.6%
 
4858.3%
 
513713.3%
 
6555.3%
 
7565.4%
 
8504.9%
 
9535.2%
 
ValueCountFrequency (%) 
3710.1%
 
3340.4%
 
3230.3%
 
3130.3%
 
2910.1%
 
2710.1%
 
2630.3%
 
2530.3%
 
2440.4%
 
22101.0%
 

YearsInCurrentRole
Real number (ℝ≥0)

ZEROS

Distinct19
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.21574344
Minimum0
Maximum18
Zeros169
Zeros (%)16.4%
Memory size8.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile11
Maximum18
Range18
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.658594827
Coefficient of variation (CV)0.8678409582
Kurtosis0.526143659
Mean4.21574344
Median Absolute Deviation (MAD)3
Skewness0.9622430334
Sum4338
Variance13.38531611
MonotocityNot monotonic
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%) 
226826.0%
 
016916.4%
 
714614.2%
 
3928.9%
 
4757.3%
 
8575.5%
 
9484.7%
 
1424.1%
 
6272.6%
 
5262.5%
 
10222.1%
 
11131.3%
 
13111.1%
 
1490.9%
 
1280.8%
 
1570.7%
 
1650.5%
 
1730.3%
 
1810.1%
 
ValueCountFrequency (%) 
016916.4%
 
1424.1%
 
226826.0%
 
3928.9%
 
4757.3%
 
5262.5%
 
6272.6%
 
714614.2%
 
8575.5%
 
9484.7%
 
ValueCountFrequency (%) 
1810.1%
 
1730.3%
 
1650.5%
 
1570.7%
 
1490.9%
 
13111.1%
 
1280.8%
 
11131.3%
 
10222.1%
 
9484.7%
 

YearsSinceLastPromotion
Real number (ℝ≥0)

ZEROS

Distinct16
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.119533528
Minimum0
Maximum15
Zeros415
Zeros (%)40.3%
Memory size8.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile9
Maximum15
Range15
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3.10865138
Coefficient of variation (CV)1.466667707
Kurtosis3.474949451
Mean2.119533528
Median Absolute Deviation (MAD)1
Skewness1.947710195
Sum2181
Variance9.663713401
MonotocityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%) 
041540.3%
 
124523.8%
 
211511.2%
 
7525.1%
 
4414.0%
 
3333.2%
 
5313.0%
 
6252.4%
 
11161.6%
 
8161.6%
 
9111.1%
 
1370.7%
 
1270.7%
 
1560.6%
 
1050.5%
 
1440.4%
 
ValueCountFrequency (%) 
041540.3%
 
124523.8%
 
211511.2%
 
3333.2%
 
4414.0%
 
5313.0%
 
6252.4%
 
7525.1%
 
8161.6%
 
9111.1%
 
ValueCountFrequency (%) 
1560.6%
 
1440.4%
 
1370.7%
 
1270.7%
 
11161.6%
 
1050.5%
 
9111.1%
 
8161.6%
 
7525.1%
 
6252.4%
 

YearsWithCurrManager
Real number (ℝ≥0)

ZEROS

Distinct17
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.005830904
Minimum0
Maximum17
Zeros188
Zeros (%)18.3%
Memory size8.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.522573815
Coefficient of variation (CV)0.8793615854
Kurtosis0.283419303
Mean4.005830904
Median Absolute Deviation (MAD)3
Skewness0.8800434637
Sum4122
Variance12.40852628
MonotocityNot monotonic
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%) 
224824.1%
 
018818.3%
 
713513.1%
 
3999.6%
 
8767.4%
 
4716.9%
 
1575.5%
 
9474.6%
 
5262.5%
 
10181.7%
 
6171.7%
 
11161.6%
 
12121.2%
 
1370.7%
 
1760.6%
 
1530.3%
 
1430.3%
 
ValueCountFrequency (%) 
018818.3%
 
1575.5%
 
224824.1%
 
3999.6%
 
4716.9%
 
5262.5%
 
6171.7%
 
713513.1%
 
8767.4%
 
9474.6%
 
ValueCountFrequency (%) 
1760.6%
 
1530.3%
 
1430.3%
 
1370.7%
 
12121.2%
 
11161.6%
 
10181.7%
 
9474.6%
 
8767.4%
 
713513.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

AgeAttritionBusinessTravelDailyRateDepartmentDistanceFromHomeEducationEducationFieldEmployeeCountEmployeeNumberEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOver18OverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStandardHoursStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManager
050.0NoTravel_Rarely1126.0Research & Development1.02Medical19974Male6634Research Director4Divorced1739966159YNo224380132125413
136.0NoTravel_Rarely216.0Research & Development6.02Medical11782Male8432Manufacturing Director2Divorced494128196YNo20448027033201
221.0YesTravel_Rarely337.0Sales7.01Marketing117802Male3131Sales Representative2Single267945671YNo13328001331010
350.0NoTravel_Frequently1246.0Human ResourcesNaN3Medical16441Male9935Manager2Married1820079991YNo11338013223325107
452.0NoTravel_Rarely994.0Research & Development7.04Life Sciences111182Male8733Healthcare Representative2Single10445153227YNo193480018438640
533.0YesTravel_Rarely1277.0Research & Development15.01Medical15822Male5633Manager3Married13610246197YYes123480015247677
647.0NoTravel_Rarely1001.0Research & Development4.03Life Sciences118273Female9223Manufacturing Director2Divorced10333192718YYes1233801284322111410
722.0NoTravel_Rarely1230.0Research & Development1.02Life Sciences18724Male3322Manufacturing Director4Married4775191466YNo22418024212222
8NaNYesTravel_Rarely890.0Research & Development2.04Medical18283Male4631Research Scientist3Single4382163746YNo17348005322221
933.0NoNon-Travel530.0Sales16.03Life Sciences116813Female3632Sales Executive4Divorced5368161301YYes25438017236512

Last rows

AgeAttritionBusinessTravelDailyRateDepartmentDistanceFromHomeEducationEducationFieldEmployeeCountEmployeeNumberEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOver18OverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStandardHoursStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManager
101923.0NoTravel_Rarely160.0Research & Development4.01Medical117353Female5131Laboratory Technician2Single3295128621YNo13338003313212
102041.0NoTravel_Rarely1276.0Sales2.05Life Sciences16252Female9134Manager1Married1659556267YNo163280122231816118
102133.0NoNon-Travel750.0Sales22.02Marketing11603Male9532Sales Executive2Married6146154800YNo13318018247707
102249.0NoTravel_Rarely1495.0Research & DevelopmentNaN4Technical Degree114731Male9632Healthcare Representative3Married6651215342YNo143280120023212
102333.0NoTravel_Rarely589.0Research & Development28.04Life Sciences115492Male7932Laboratory Technician3Married5207229491YYes12328011533151457
1024NaNNoTravel_Rarely750.0Research & Development28.03Life Sciences115962Male4642Laboratory Technician3Married3407253481YNo1734802103210968
102541.0NoTravel_Rarely447.0Research & DevelopmentNaN3Life Sciences118142Male8542Healthcare Representative2Single6870155303YNo123180011313212
102622.0YesTravel_Frequently1256.0Research & DevelopmentNaN4Life Sciences112033Male4821Research Scientist4Married285342230YYes11328011530000
102729.0NoTravel_Rarely1378.0Research & Development13.02Other120534Male4622Laboratory Technician2Married4025236794YYes133180110234303
102850.0NoTravel_Rarely264.0Sales9.03Marketing115913Male5935Manager3Married19331195194YYes163380127231000